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Abstract —In this paper, we study the symmetry of polar 
codes on symmetric hinary-input discrete memoryless channels 
(B-DMC). The symmetry property of polar codes is originally 
pointed out in Arikan’s work for general B-DMC channels. 
With the symmetry, the output vector (N he the block 
length) can be divided into equivalence classes in terms of their 
transition probabilities. In this paper, we present a new frame of 
analysis on the symmetry of polar codes for B-DMC channels. 
Theorems are provided to characterize the symmetries among 
the received vectors. With this new perspective, we can fnlly 
utilize the property of the underlying channels and reduce the 
number of equivalence classes. The analysis is applied to binary 
symmetric channels and shows a great reduction of the number of 
equivalence classes compared with the original symmetry setting. 

I. Introduction 

Polar codes are invented by Arikan in HI. Polar codes are 
the first class of codes that can achieve the capacity for sym¬ 
metric binary-input discrete memoryless channels (B-DMC) 
with a low complexity. The encoding and decoding (with 
successive cancellation, SC) has a complexity of 0(A^log A^) 
m. The binary input alphabet in Arikan seminal work 12 
is later on extended to non-binary input alphabet El-11. 
The construction of polar codes are reported in Q-® and 
different procedures are proposed assuming the original 2x2 
kernel matrix. Polar codes based on the kernel matrices of size 
I X I are studied in I®. In this paper, we study the original 
binary-input coding scheme. 

Let VU be a binary discrete memoryless channel (B-DMC). 
Let N be the code block length. The transition probability of 
the Ah split channel is determined by the received vector (with 
a length N), and the previously decoded i — 1 bits, resulting 
in a number of possible outputs of where y is the 

output alphet. In |j2, Arikan showed that if W is symmetric, 
then the split channel is also symmetric and the number of 
possible outputs for the Ah split channel can be reduced to 
|3^|*. The received vectors with the same transition probability 
form an equivalence class. The construction of polar codes in 
0-® did not fully utilize the property of the symmetry of 
polar codes as we just describe simply because the number 
|3^|* is still large for large block lengths. 

In this paper, we study the further symmetry of polar codes 
for B-DMC channels. We show that finding the number of 
classes of received vectors with the same transition probability 
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can be transformed into a problem of finding the number 
of solutions to a set of equations. Given the symmetry of 
the underlying channel W, the number of solutions to the 
set of linear equations can be greatly smaller than |3^|* for 
the Ah split channel. We provide theoretical analysis on how 
to find the equivalent received vectors. The exact number of 
equivalence classes for a set of split channels are also provided 
in closed form expressions. The symmetry properties of polar 
codes in this paper are applied to binary symmetric channels 
(BSC). The comparison between the number of equivalence 
classes obtained in this paper and the original number 2* for 
the Ah split channel shows that our frame of analysis can be 
conveniently applied when trying to explore the symmetries 
of polar codes for potential uses. 

Eollowing the notations in 0, in the paper, we use to 
represent a row vector with elements {vi,V 2 , We also 

use V to represent the same vector for notational convenience. 
Given a vector the vector vj is a subvector 

with 1 < i,j < N. If there is a set G {1,2, then 

denotes a subvector with elements in {vi,i G .4}. 

The rest of the paper is organized as follows. In Section 
im the symmetry of polar coding is introduced. Section [HI] 
introduces theorems used in this paper for characterizing the 
equivalence classes. Then theorems and procedures on how to 
find the exact number of equivalence classes are presented in 
Section |IV] Concluding remarks are presented in Section |Vl] 

II. Symmetry OF Polar Codes 

In this section, we briefly restate the symmetry of polar 
codes introduced in 0 and present some of our own notations 
used in sequel of the paper. 

Denote the transition probability of the underlying channel 
W as W{y\x). Define 

K) = W^{y^\u^GN) (1) 

where {y^\u^GN) is the A^-time uses of the underlying 
channel W, the block length N = 2”, and Gn = = 

F^^B. The matrix B is the bit reverse permutation matrix, 
and L = [ j 5] 0. In this paper we use Gn = A"®" directly 
instead of Gat = BF®”-. The only difference Gn = BF®'^ 
makes, compared with Gn = F®”, is to perform a permu¬ 
tation on the received vector y^, which doesn’t affect our 
analysis. Note that throughout this paper, the input vector , 


the matrix F = J]’ generator matrix Gn are all 

defined on the binary field F 2 . The output is defined on 
the set y. 

The operation in O is the channel combining stage. In 
the channel split stage, the ith split channel has a transition 
probability of 

E (2) 

To facilitate the analysis, the operator • is defined as: 


0 ■ y=y, y (3) 

1 • y=v, y,v Gy (4) 

and V satisfies 

Wiv\0)=Wiy\l) (5) 

W{v\l)=W{y\0) (6) 

In other words, v is the corresponding symbol that is sym¬ 
metric to y. To divide the output alphabet into two groups, we 
also define two sets: 

SELF(3;)={y Gy\l-y = y} (7) 

SYMM(3;)={y Gy\l-y^y} (8) 


Set SELF (3^) includes the output y which has a symmetric 
symbol of itself while set SYMM(3^) has elements with a 
different symmetric symbol. Obviously, the cardinality of the 
set SYMM(3^), I SYMM(3^)|, is an even number. As shown in 
im , for symmetric B-DMC channels, it is sufficient to discuss 

wj;\y^,0\-^\0)= E 

m (9) 

as every W^\yi,u\ ^\ui) is equivalent to a 

0*i“^|0) where y^ is selected from the following 
set: 

Yi+i = {yf : of Gtv • yi , and a\ = u\} (10) 

In (fTOl) . the operation GN-yi is the element-wise operation 
of the • operation defined in @ and (|4li. In this paper, we will 
simply use lE^^(y),y G y^ to replace as 

the previous bits and the current bit are all zeros. And if two 
received vectors y and v have the same transition probability 

w^N{y) = w^N\^) ( 11 ) 

then we call y and v probability equivalent on bit channel 
i. Seen from (fTOl i. if y and v are probability equivalent on 
bit channel i, they are also probability equivalent on the row 
space of the submatrix Gfj^^, where Gfj^^ means the rows of 
Gn from the (i -f l)th row to the A^th row. 


III. Characterization of the Symmetry 
For the convenience of the description, we denote the 
submatrix Gf^i of the matrix Gn as A{N,i)- Then A(2N,i) 
is the submatrix formed by the rows from (i -f 1) to 2N of 
G 2 N = For a given block length N, we abbreviate 

A{N, i) = G^i as A = Gf^i since the size of A is clear 
from the context. Denote R(A) as the row space of A. For 
the received vector y G y^, define 

R(A).y = {u.y| uGR(A)} (12) 

In the following, we use a simpler notation to refer to the 
set defined in drill : R(A, y) = R(A) • y. Seen from (l9]l, the 
transition probability for the ith channel is exactly summed 
over the row space of A. Using the notation W^\y) = 
VFj^l(y, 0*j“^|0), we can rewrite (|9ll as 


w^\y)= E 

2 Ar-i(^N(y|u) 

(13) 

uGR(A) 


= E 

2 Ar-i(^N(u-y| 0 f) 

(14) 

uGR(A) 


= E 


(15) 


v£R(A,y) 


The equality in (fT4l i is obtained from the application of the 
symmetry property (O and (|4|i of the underlying channel W. 

In the following, we present theorems to characterize the 
properties of the received vectors which belong to the same 
equivalence class. With our frame of analysis, the original 
symmetry property presented in HI is also provided. 

Theorem 1: Let P be a permutation matrix. Denote Tp(-) 
as the permutation transformation. If there exists an invertible 
matrix H so that AP = HA, then v = Tp(y) and y are 
probability equivalent on A, where A is A{N,i). 

Proof: Since the matrix P is a permutation matrix, the 
inverse matrix of it is P^. Thus the transformation Tp is 
a bijection. Let’s consider the two spaces R(A, Tp(y)) and 
R(AP, Tp(y)). Obviously the row space of A and HA is the 
same, so the row space of A and AP must also be the same, 
since AP = HA and H is invertible. For each z G R(A, y), 
there exists a Tp(z) G R(AP, Tp(y)) = R(A, Tp(y)) and 
vice versa. Since 

lE^(z|0) = lE^(Tp(z)|0) (16) 

y and Tp (y) must be probability equivalent on A for the Ah 
channel. ■ 

The following corollary can be easily obtained from The¬ 
orem [T] if we assume the matrix A is an invertible matrix. 

Corollary 1: The vectors y,Tp(y) G y^ are always 
probability equivalent on an invertible matrix, where P is a 
permutation matrix. 

Note that the matrix A = Gfj^i can’t be an invertible 
matrix for any bit channel i. But if A can be decomposed 
into several small matrices and each of these small matrices is 
row equivalent to an identify matrix, then this Corollary [T] can 







be used to characterize the symmetry of the received vectors 
operated on those small matrices. 

Theorem 2: If there exists a vector u G _ 

uA, then v = k • y and y are probability equivalent on bit 
channel i. 

Theorem |2] is equivalent to the symmetry property in [I] . 
Please refer to III for the proof there. 

Corollary 2: For BSC channels, the number of equivalence 
classes for the *th bit channel can be calculated from y G 
and = 0. 

Proof: For a BSC channel, the operation in Q and (|4]i 
becomes 0 ■ y = y and 1 • y = y © 1. Also note that columns 
from i + 1 to of the matrix A form an independent set 
as the matrix Gn is lower triangular. Therefore, for any y^ 
defined in the binary field F 2 , we can find a unique vector 
u G such that the vector k = uA and = y^+i- 

This makes the vector v = uA • y have From 

Theorem |2] it’s seen that v and y are probability equivalent 
on the ith bit channel. Thus the equivalence classes for BSC 
channels can be investigated based on the received vector 
with which results in 2® as the largest possible 

number of equivalence classes. ■ 

The following Theorem [2 states that the equivalence of two 
vectors for the ith bit channels is preserved when the block 
length increases from N to 2N. 

Theorem 3: If vector y and v are probability equivalent on 
A{N, i) = A, then u = (1, 0) © y and t = (1, 0) © v are also 
probability equivalent on A{2N,i)- 

Proof: Define B = (1,0) © A, C = A{2N, i), and D = 
A{2N, N). The matrix D is row equivalent to (1,1)©/Ar from 
the definition of the generator matrix Gn- It’s clear that the 
row space of the matrix B, D are subsets of the row space 
of matrix G\ R(f3) C R(C'), R(f?) C R(C'). If y and v are 
probability equivalent on A, there exists a bijection Tsuch 
that 

FF'^(h|0) = R^^(Tp,(fi)|0) (17) 

where h G R{A,y), Tp^(/i) G i?(A, v), and Ph is a 
permutation matrix that may vary with h. Then u and t 
must also be probability equivalent on B, since there exists 
a bijection / such that 

/:hGi?(B,u)->hihGR(S,t), Lh = (l,l)0^’h (18) 
Now, there’s a partition of t), 

Par(t) = {R{D) • r| r G i?(R, t)} (19) 

The same for R{C, u), 

Par(u) = {R{D) • r| r G i?(R, u)} (20) 

For any e G R{G, u), it must belong to some R{D) ■ r G 
Par(u). According to Theorem [T] r G R{B,u) and /(r) G 
R{B,t) are probability equivalent on R{D), since D and 
DLh are row equivalent. It follows that there exists a bijection 
er(e) G R(79) • /(r) C R{G,t) that satisfies e^(e) = ePe- 
Therefore, u and t are probability equivalent on A(2N,i). ■ 


Corollary 3: For BSC channels (which means y = F 2 ), 
the number of different transition probabilities for bit channel 
i doesn’t change with the block length N . Though the value of 
the transition probability kF^^(y) and kF 2 ]^((l, 0 ) © y) may 
not be the same for different block lengths. 

IV. Evaluation of the Number oe Equivalence 
Classes 

In Section |III] the symmetry property is introduced which 
can be used to determine if any two outputs have the same 
transition probability for bit channel i. In this section, we 
analyze the exact number of different transition probabilities 
for bit channel i. In [I], this is the problem to find the number 
of equivalence classes. 

A. Transformation of the Output Alphabet 

In this section, we transform the study of the symmetry 
based on the output y G to another output alphabet, which 
is more convenient to manipulate. 

Before Theorem |4] is introduced, some new notations are 
needed. We use A, •) to denote the problem of studying 
the symmetry based on the transition probability in equation 
O. Denote = |SELF(V)| and S 2 = |SYMM(V)|. 
Remember that SELF(V) and SYMM(V) are defined in 0 
and IDl, respectively. Consider the received vector y G y^ and 
we count the number of occurrences of all possible symbols 
within this received vector. Suppose qt is the number of oc¬ 
currences of the tth symbol of SELF(V) and et is the number 
of occurrences of the fth symbol of SYMM(V)- Without loss 
of generality, assume the corresponding symmetric symbol of 
the tth symbol is its (S '2 + 1 — f)th symbol. That is 

lE(st|0) = VF(ss,+i-t|l),st G SYMM(V),1 < t < S 2 

( 21 ) 

Define a new mapping as 

y'(y) = (22) 

It’s seen that y'(y) is well defined in the sense that for any y, 
the mapping y'(y) is unique. Define a new set corresponding 
to y^ as 

y={y'(y)l (23) 

SELF(y)= 

{y'(y)l et=es 2 +i-tA <t<S 2 ,y€ v"} (24) 

SYMM (y) =y - SELF (y) (25) 

If z = (gi,...,gsi,ei,...,es 2 ) G y, define * as 

0 * z=z (26) 

l*z=(9i,---,<?Si,es2,...,ei) (27) 

Obviously 

SELF(y) = {z G y I z = 1 * z} (28) 


Corollary 4: From the definition of the set y', the cardi¬ 
nality of it, is the number of solutions to the following 
equations; 


Sl<t<S2 “*■ Sl<t<Si ^ 

et > 0 , qt > 0 


(29) 


The number of solutions to this equation is |3^'| = 

S 2 +Si-i^)’ which is proved in the Appendix. The size of 
the set SELF(3^'), | SELF(3^')|, is the number of solutions to 
the following equations; 


f Si<t<S 2 J2i<t<Si = N 

% e* = es 2 +i-t, ^ <t < S 2 (30) 

[et > 0 , gt > 0 

The number of solutions of (EOjl, which we will denote as 
NF(S'i, 52 , A^) is; when 5i ^ 0 


|SELF(y)| 


E 

0 <r<N /2 


/r + S 2/2 - 1\ 

I ^2/2-1 J 


and when 5i = 0 


/N-2r + Si- A 

I - 1 ) 

(31) 


|SELF(y)| 


/N/2 + S 2/2 - 1\ 

V S 2/2 -1 ) 


(32) 


Note that, when = 1, | SELF(y)| = 0. The proof of the 
solution is omitted in this paper due to the space limit. And 
the size of the set SYMM(y) is 


I sYMM(y)| = |y I -1 sELF(y)| ( 33 ) 


which will be denoted as NM(5i, S 2 , N). 

Applying Corollary 0] to the BSC channel, we can obtain 
the following corollary. 

Corollary 5: If 5 i = 0, ^2 = 2, then |SELF(y)| = 1 , 
|SYMM(y)| =N. 


B. Evaluation of the Number of Equivalence Classes 

In this section, we use the transformed output alphabet in 
Section IIV-AI to evaluate the exact number of equivalence 
classes. We first provide the following theorem. 

Theorem 4: For bit channel i, we denote the number of 
the remaining rows of the generator matrix Gn as a = N — i 
(N = 2”). Let a' be any number in the range N > a' > a and 
a' = 2^ for some number k'. Denote the number of different 
transition probabilities of bit channel i as NC{N,i, Si, S 2 ), 
where Si = |SELF(3;)| and S 2 = |SYMM(3;)|. Then the 
number of different transition probabilities for bit channel i is 
equivalent to NC(W, i, 5i, ^ 2 ) = NC(a', i — (A^ —o'), ^ 3 , ^ 4 ), 
where S^ = NF(5i, ^ 2 , N/a'), S^ = NM(5i, ^ 2 , N/a'). 
Proof: We know 

_ p 0 n-k' igj p 0 k' 

The last a' rows are (1,1,...1)® F®* . Define vector z(y) G 
ya' 

z(y){j} = + a',j + 2a'...j + N-a'}),0 < j < a' 

(35) 


where y is the output vector. Here y{j} means the jth entry of 
y. Look at the definition of y', it’s seen that z is surjective. 
For any given y we can get a unique z(y). Also, from the 
definition of y', (y) can be calculated from z(y). Observe 


z((( l,l,^...,lj 0 v) • y) = z(y) * V 

N/a' 


(36) 


where v G R(A(a',i — {N — a'))). Then {y^, A{N 
is equivalent to (y“ ,A{a',i — {N — a')),*). So we get 
NC(y i, Si,S 2 ) = NC(a', t-{N- a'), S 3 , S 4 ). ■ 

Theorem 5: 


NC(W,0,5i,52) < 


/N + S 2/2 + 5i - A 
52/2 + 5 i -1 ) 


(37) 


Obviously, this is not the number of transition probabilities 
for any bit channel i. However, it can be used to deduce the 
number of transition probabilities for some bit channnels. 
Proof: Since rank{A{N,0)) = N, obviously; 


R(A(y0))=F^ (38) 


Define S C SYMM(3^) as a set that satisfies |5| = S 2 / 2 , 
and Vt/i, y 2 C S, 1 ■ yi t/ 2 - Then for any y G y^, we 
can have a v = u • y, u G F^, v G y^, so that elements of 
V will only choose from S U SELF(3^). Since any element 
has 52/2 + 5i choices and the order doesn’t matter, so there’s 
total most. ■ 

For some cases, we can have more accurate results than The- 
orem|5] Define 0- lA = logW{yi\0), 1 • lA = logW{l ■ yi\0), 
and Di = 1 ■ Wi — 0 ■ lA, then we have 

Theorem 6 : If A^s are all different for all i, then 


NC(W,0,5i,52) 


fN + S2/2 + 5 i - A 
V 52/2 + 5i-1 ) 


(39) 


Proof: 


log{W^{y\a.))= ^ ai-lA,aGF^ (40) 

l<i<N 

=log{W^{y\0^))+ (41) 

l<i<Af 


If y and v are probability equivalent, there exists u G F^ so 
that lA'^(u-y|0) = lA'^(v|0). Define V 2 = u-y. We can see 
that V 2 and v must have the same set of {Dy}, which means 
V 2 must be a permutation of v. This shows that the symmetric 
properties have been discovered (Theorem [T| and Theorem |2| 
are enough to find all equivalent output vectors. So the less 
equal sign in Theorem |5] can be replaced by an equal sign. ■ 
Also, if bit channel i can be converted to a channel with 
i' = 0 , then we can have an exact evaluation. 

Corollary 6 : If 5i = 0, ^2 = 2, then for i = N — a, where 
a is power of 2 , 

/Y /o + —\ 

NC(lV,t,5i,52) = NC(a,0,l,-)= (42) 

“ V 23 : / 

This is a numeric result on BSC channels. It provides the exact 
number of different transition probabilities for bit channel i. 




For N = 2", we choose a = to calculate the number 

of equivalence classes. When n is even NC(7V, i, 0, 2) = 


_ (^\VNoiVN 

. £ J m!(VNyr^2^ 


(43) 


smallest number of different transition probabilities for each 
bit channel. For some of the bit channels, an exact evaluation 
of the number of different transition probabilities (or the 
number of equivalence classes) is provided. The analysis for 
any given bit channel is still under investigation. 


and when n is odd NC(N, i, 0, 2) = 

^ ^ 2 \/ 2 lv 

V I > ((yf)!)^~ \/f 


(44) 


Although these are quite large numbers, they are still much 
smaller than 2* = 2 ^““ = obtained using the 

symmetry in ITJ. 


V. Numerical Results 

To test Corollary|6l we numerically calculated the number of 
different transition probabilities for all y S F^. The 

numerically calculated results match the numbers calculated 
with Corollary |6]up to * = 16. For larger ts, the time consumed 
to calculate the number of different transition probabilities is 
too long to be practical. From the table, we see for the BSC 
channels, the number of equivalence classes can be greatly 
s maller than 2^ for the ith channel. 


i 

Result 

i 

Result 

2 

3 

128 

129 

3 

3 

192 

2,145 

4 

5 

224 

58,905 

6 

6 

240 

7.35-105 

7 

5 

248 

7.35-105 

8 

9 

252 

58,905 

12 

15 

254 

2,145 

14 

15 

255 

129 

15 

9 

256 

257 

16 

17 

384 

8,385 

24 

45 

448 

8.14-105 

28 

70 

480 

7.69-10^ 

30 

45 

496 

6.71 - I05 

31 

17 

504 

7.69-10^ 

32 

33 

508 

8.14- I05 

48 

153 

510 

8,385 

56 

495 

5II 

257 

60 

495 

512 

513 

62 

153 

768 

33,153 

63 

33 

896 

1.21 - 10^ 

64 

65 

960 

1.2 - I0i° 

96 

561 

992 

7.68-105 

II2 

4,845 

1,008 

7.68-105 

120 

12,870 

1,016 

1 .2- I0i° 

124 

4,845 

1,020 

I.2I - 10^ 

126 

561 

1,022 

33,153 

127 

65 

1,023 

513 


VI. Conclusion 


In this paper, we present a frame of analysis to fully explore 
the symmetries of polar codes, which is used to analyze the 


Appendix 

Lemma 1: The number of solutions to this equation 


Si<t<S2 4" Si<t<Si It — N 

et > 0 , qt > 0 


is 


(N+Si+S2-1\ 

V S 2 +S 1-1 r 

Proof: The equation can be transformed into 


(45) 


Si<t<Si+S 2 Wt - N + Si + S 2 

Wt>l 


(46) 


which is equivalent to 


{ l<P(<W + S'i + S 2 — 1 
1 < f < S'! + S '2 - 1 
Ptl 7^ Pt2 

Pi <P2 < ... < PS 1 +S 2 -I 


(47) 


Then wt can be seen as 

/ 

Wi = Pi 

W2 =P2- Pi 

< ... (48) 

WS 1 + S 2 -I =^51+52-1 -FS 1 + S 2-2 
WS1+S2 = N + Si + S2 -PS1+S2-1 

The number of solutions to equation dSll is equivalent to 
taking + S '2 — 1 balls out of + 52 — I balls without 

considering the order. Thus the lemma is proven. ■ 
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